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I. INTRODDCTION AND REVIEW OF RELEVOT liITESATDRE 



A. INTRODDCTION 



This thesis is an investigation of n 
predicting the rate of reenlistment in the arm 
specifically the Navy. Since the advent of the al 
force in 1973, one of the major concerns of the m 
been the retention of qualified personnel beyond 
enlistment. Referred to as first term reenlistn 
decision has been the object of extensive study a 
by each of the services. The vast majority of tn 
centered around the formulation of causal moi 
heavy emphasis on economic factors. During pe 
reenlistments have been below the required lev 
models have been quite good at capturing the ef 
economic factors used to suggest the level o 
compensation. Over the past four years, this si 
changed to the point where the services are face 
large numbers of personnel desiring to rema 
services, that high reenlistment is actually L 
numbers of personnel that can be enlisted for s 
at the recruiting stations. As always, there is 3 
for more personnel in the nuclear related ratia 
of the less technical fields are approaching an 
where only a limited number of billets may be a 
new accessions. 

The object of this thesis is to attempt to 
short term model that could aid in predicting 
reenlistments for five selected Navy ratings. Ini 
will be developed utilizing the Box-Jenkins meth 
series analysis. These initial models* will 
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predict the level of reenlistaent for the selected ratings. 
Next, a leading indicator model will be developed utilizing 
the national unemployment rate for 20-24 year olds. Then, a 
refined forecast will be developed combining both time 
series and causal models. 

The potential advantage of time series analysis lies in 
its simplicity and lack of reliance on external factors. To 
many, this is viewed as a shortcoming since it cannot 
explain causal relationships such as the effect of adver- 
tising dollars on sales for a company. However, the effec- 
tiveness of advertising dollars is not easy to determine and 
in many cases, may lead to false conclusions when used in 
classical regression analysis. Other errors such as the 
autocorrelation of an independent variable or the raulticcl- 
linearity of several variables are also avoided in the use 
of time series analysis. Since time series analysis relies 
on its ability to reproduce itself over time, this allows it 
to be free of the errors of regression and still retain the 
ability to adequately forecast events: or ia this case 

levels of reenlistment. A more thorough explanation of the 
Box-Jenkins method is presented in Appendix B. 

If a time series model is accurate at predicting changes 
in trends as well as levels of reenlistment, then it may 
also be useful as a tool to adjust the levels of reenlist- 
ment bonuses to the most cost effective level aecesarry to 
retain the desired force levels. If a model can accurately 
foresee a significant increase in reenlistments, indepen- 
dent of the reenlistment bonus, the bonus leirel for that 
rating can be scaled down appropriately to retain personnel 
without the payment of an economic rent. (Rent, in this 
case, is the payment of a bonus for a decision independent 
of the bonus award level. ) To summarize, time series 
analysis may not prove to be a panacea in forecasting reen- 
listment rates but it most definitely is a tool that 
deserves wider consideration and application. 
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B. REVIEW OF RELEVANT LITERATDRE IN THE FIELD 
1 . Overvi e w 

To the best of the author's knowledge, the applica- 
tion of Box-Jenkins Time Series Analysis to reenlistment 
models has not been previously attempted. A search, 
conducted by the Defense Technical Information Center 
(DTIC) , utilizing both title and subject, did not reveal any 
references to its use of Box-Jenkins modeling far reenlist- 
ment. Bepko [Ref. 1], in his thesis modeling rareer petty 
officers reviews the relevant literature concecning first 
term reenlistment and career reenlistment from 1974 to the 
publication of his thesis in 1981. This review will address 
relevant publications since that time. 

2 • The Annuali zed Cost of Leaving { ACOL) Mo i el 

This model was developed for the Navy by John Warner 
at the Center for Naval Analysis and is currently the most 
widely used model in the Navy with relation to manpower and 
personnel policy decision making. 

The model itself is a sophisticated multiple linear 
regression that attempts to capture several impoctant under- 
lying forces in the reenlistment decision. The ganeral model 
is of the form; 



C = 



M / (1 +r) + 



/(Ur) 



7j-t 



-(W^.+ R^i (egn 1.1) 



where ; 

1. C (t^n) = Net present value of pecuniary and non- 
pecuniary returns of staying in the military until 
time 'n' as compared to leaving at time 't'. 
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2. M (j) = Monetary returns to military service from 

period 'f through *n' 

3. W (n) = Lump sum payment of the present /alue of the 

expected post service civilian wages realized by 
those staying in the military until time 'n’. 

4. R (n) = Lump sum payment of the present ?alue of the 

expected retirement benefits realized by those 
staying in the military until *n' 

5. W (t) = Present value in year *t' of the expected 

civilian wages realized by those leaving the military 
in year ' t' . 

6. R (t) = Present value in year *t* of the expected 

civilian retirement payments for those Leaving the 
military in year 't* 

7. r = personal discount rate 

As can be seen, this model relies on de/elopment of 
several sub-models relating to civilian wage structure, 
future policy decisions concerning lump sum boa us payments 
and an individual’s personal discount rate over time. While 
the results obtained with this model have been superb, the 
model itself is complex and somewhat difficult. Since the 
introduction of the general model described abo/e, it has 
been further refined to include a 'taste' facta r for mili- 
tary service and a random disturbance term whici is used to 
capture the effects of sea shore rotation, poor duty station 
and family separation. These factors improve tie model but 
at a cost of ever increasing complexity. 

3- Dar lin g * s Mo del of Marine Corp En list me nt s 

Darling [ Refo 2], utilized a combination of 
Box-Jenkins time series analysis and multiple linear 
regression to predict the supply of upper mental category 
recruits to the Marine Corp. The procedure entailed develop- 
ment of three separate models using two distinct techniques. 



the first was a standard multiple regression of 
form : 

s(M)= o-Mo </[ L Ho +(a- t Mo)e ^ ] 

wher e ; 

1. S = The supply of military recruits 

2. = The military wage 

3. a = The stochastic error term 

In this phase of development, the following va: 
introduced : 

1. Civilian military pay ratio 

2. National unemployment for 16-19 year old m 

3. Monthly leads from print media 
U. Number of Marine recruiters 

5. Dummy variable to account for anomalies 1 
ular time period. 

Further analysis led to inclusion of items one, t 
from the above list in the final regression model 
Next, a totally independent model 
Box-Jenkins methodology was developed for Marine 
This was a univariate model whose purpose was to 
effect of seasonality that was missed by the 
model- The Box-Jenkins method yielded a model of 

'‘t = 4>, \-i -<P, +0O + et - 6>/e ^ 

where ; 

1, Y = The number of high school graduates 
categories I and II that enlist in the Mac 
month t- 



the logit 



(eqn 1.2) 
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. I = The autoregressive coefficient that describes the 
model 

3. 9 = The moving average coefficient that describes the 

model 

'4. e = The error term of the model. 

These two models were initially used to generate 
independent forecasts of the actual enlistment supply. Both 
models were adequate in capturing trends in the actual 
enlistment rate but were somewhat deficient in the actual 
numbers generated. At this point. Darling combinad the tech- 
niques by utilizing the Box-Jenkins method to ta<e advantage 
of the high degree of serial correlation remaiaing in his 
regression models residuals by modeling the residuals as a 
separate time series. The result of this method was to 
predict ’error’ terms to be applied to the results of the 
regression equation. This combined model was summarized by 
.eqn 1.4 shown below; 



LSVc = LSVmr + Zt’ 



(eqn 1.4) 



where; 

1. LSVc = The predicted value of the combinai model 

2. LSVmr = The predicted value using the regression 
model only 

3. Zt’ = The Box-Jenkins model of the residuals of the 
regression model 

The resulting combined forecasts were substantially 
better than either of the techniques could acaieve sepa- 
rately and was extremely accurate in capturing the general 
trend of the enlistment supply. 
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4 . Be pko * s Rear ess i oR Mo del 



Bepko [Ref. 1], concentrated his 
multiple regression model for foreca 
beyond the first enlistment. This m 
it evaluated the Navy Rating stru 
groupings and utilized an age specifi 
unemployment into the model. This was 
at ratings and unemployment together 
Bepko’ s [Ref. 1], overall fi 
ating the reenlistment decision by 
rists yielded more relevant models 
bonus payments and that unemploymen 
group was a very significant facto 
decision of careerists. 

5. The Thomas-Liao Model of Care 
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II. AP PLI CATION OF E OX- JEN KINS ANAi^SIS TO THE J NI VAR IATE 

MOD ELS 

This chapter presents the analysis of re-enlL s tmen t data 
by the Box- Jenkins technijue. Box-Jenkins analysis involves 
three steps; 

1. Identification - This involves analysis ■> f the time 
series plots of the raw data to try and discern any 
obvious trend or seasonality 

2. Estimate - This step involves analysis oc the auto- 
correlations and partial autocorrelations to provide 
an estimate for an initial model 

3. Forecast - This step involves running the models and 
generating predictions which are then ei^aluated for 
their adequacy. 

Should any model prove inadequate, the estimation of the 
model is re-evaluated to find a more suitable model. 

A. INITIAL ANALYSIS OF THE DATA SETS 

The Box-Jenkins method was applied to data sets of the 
number of first term reenlistees for the followiig ratings: 

1. Yeoman (YN) 

2. Storekeeper (SK) 

3. Operations Specialist (OS) 

4. Electronics Technician (ET) 

5. Boiler Technician (BT) 

These ratings were selected for analysis aecause they 
presented a representative mix in mental category groupings, 
varying degrees of general and specific trainiag and also 
provided sufficient numbers of reenlistments to perform 
valid analysis. 
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The data consisted of monthly summaries of 
reenlistment percentages for the subject rating 
the period from October, 1980 through September, 
data were provided by Mr. James McEwan, the statL 
the Re-enlistment Programs Development Office { 
summary of the data is shown in table I. R 
percentages for these ratings ranged from a low 
month (BT) , to a high of 9 1,2 per month (ET).i E 
noted here that the time series plots of th 
neither suggest any clear cut trends or seasonalL 
the series similar across the ratings. 

The time series plots on the following pag 2 
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t should be 
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and the horizontal 


axis is the time 


line. 
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point is 


September, 1933. 











TABLE I 

SOHSARY STATISTICS OF REENLISTMENT RATES 



RATE 


DATES 


MEAN 


ST.DS V 


YN 


10/80-8/83 


55.5 


1 1,9 


SK 


10/80-9/83 


49. 5 


13.2 


OS 


10/8 0- 9/83 


43, 6 


18.0 


ET 


10/80-9/83 


9 1,4 


0 6. 5 


BT 


10/8 0-9/83 


4 1.2 


14,7 
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B. EXAtllNATIDN OF AO TOCORRELAT ION S AND 

AOTOCORRELATIONS 

Autocorrelations describes the associatiD 
values of the same variable but at different ti 
Autocorrelation coefficients provide important 
about the structure of a time series. These c 
can be used to identify trends and possible 
within the data. [Ref. 4] 

Partial autocorrelation is used to identify 
of the relationship between current values of 
with earlier values of that same variable, while 
effects of all other time lags constant. [Ref. 4 

1 . Ye oman 

Examination of the autocorrelations for 
rating. Fig. 2.6, suggests that the data is sti 
should not reguire any transformation prior to 
ysis. The residuals are within two standard e: 
mean zero and appear to be randomly distribj 
partial autocorrelations. Fig. 2. 7, decay to z 
and appear random after this point suggesting ti 
regressive model may be appropriate. 

2 • St orek ee pe r 

Examination of the autocorrelations for 
set indicated the residuals met the randomness 
two standard errors. The shape of the residuals 
be a decaying sine curve Fig 2.8 The partial a 
tions showed no shape but dropped to zero gradual 
randomly distributed. Fig. 2.9, not suggesting aa 
model . 
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Figure 2.7 PARTIAL AO TOCOERELATIONS FOR YH RATING. 
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Figure 2.9 PARTIAL AO TOCORRZLATIO NS FOR SK RATING. 

Specialist data set again indicated that the residuals were 
randomly distributed and had characteristics which strongly 
suggested the use of some type of autoregressive operation 
in model selection. 
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Figure 2.11 PARTIAL AD TOCORRELATIONS FOR OS RATING, 


4. 


Electronics 


5 JSilliiS ians 



The data set for the Electronics Technicians exhib- 
ited a strong trend when the residuals were evaluated, in 
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addition there was some non-stationarity suggested, which 
could require di ff er encing^ to remove. Fig. 2.12 illustrates 
the shape of the ACF function. Evaluation of tie resultant 
autocorrelation showed that this process may not be neces- 
sary as the first lag exceeds -0.5 in magnitude which is a 
classic indication of an ov erdif f erenced data set. 

The shape of the autocorrelation and the partial 
autocorrelation. Fig. 2.13, suggest that an autoregressive 
or possibly a mixed model may be appropriate for evaluation. 
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Figure 2.12 AUTOCORRELATIONS FOR ET RATE NG. 



2The method of differencing converts non-stat ionary time 
series into a stationary one. It consists of subtracting 
successive values from one another and using ti ere differ- 
ence as a new time series [Ref, 4], 
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Figure 2-13 PARTIAL AO TOCOEEELATIONS FOR ET RATING. 

5 • B oiler T ec hn ici ans 

The Boiler technician rating data set jxhibited no 
strong trend in the autocorrelations. Fig. 2.14. The shape 
of the autocorrelation and partial autocorrelation. Fig, 
2.15, also strongly suggested that an autoregressive type of 
model should be considered for evaluation. This was indi- 
cated by the decaying sine wave pattern in the ACF and the 
abrupt cutoff of the value of the PACF. 



C. MODEL DEVELOPMENT 

In developing the models, the autocorrelations and 
partial autocorrelations were evaluated against representa- 
tive Box-Jenkins models of the au toregressivs (AE) and 
moving average (HA) type. A best fit model was ti en selected 
for evaluation utilizing the Minitab General Purpose 
Statistical Computing System. The results of these models 
were then evaluated to ensure the residuals wera random and 
less than two standard errors of the mean zero. If the model 



30 





1 

00 

t 

o 

1 


-C.6 -0.4 -0.2 0. 0 0. 2 0.4 


0.6 


0.8 




f - 








1 


0.450 


IXiXXXXXXXXX 






2 


0.208 


XXXXXX 






3 


-0.02 8 


XX 






4 


-0.238 


xxxxxxx 






5 


-0.31 1 


xxxxxxxxx 






6 


-0.394 


xxxxxxxxxxx 






7 


-0.360 


sxxxxxxxxx 






a 


-0. 194 


XXXXXX 






s 


0.101 


xxxx 






10 


0.122 


xxxx 






11 


0.180 


XXXXXX 






12 


0.088 


XXX 






13 


-0.009 


X 






14 


-0.080 


XXX 






15 


-0.117 


XXXX 




1 


Figure 2. 14 


AOTOCOfifiElATIONS FOR BT RATING. 




I 


00 

• 

o 

1 


-0.6 -0. 4 -0.2 0. 0 0. 2 0.4 


t 

o 


0. 8 




t 




. 1 




1 


0.450 


XXXXXXXXXXXX 






2 


0.007 


X 






3 


-0. 155 


XXXXX 






4 


-0.221 


XXXXXXX 






R 


-0. 133 


XXXX 






6 


- 0 . 22 1 


xxxxxxx 






7 


-0. 162 


XXXXX 






8 


-0.028 


XX 






S 


0.184 


XXXXXX 






10 


-0. 128 


xxxx 






11 


- 0 . 04 9 


XX 






12 


-0. 164 


XXXXX 






13 


-0.140 


xxxx 






14 


-0. 144 


XXXXX 




1 



Figure 2.15 PAB3IAL AaiOCCBfiELATIONS FOB BT BATING. 

was satisfactory at this point, the sum of squared errors 
(SSE) was evaluated to determine if it was the lowest 
possille reduction in the SSE. In the event that more than 
one model passed these tests the t-ratios were then 
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evaluated. Again, if all these tests were insignificantly 
different aaong a fanily of models the principle of parsi- 
mony was used to select the "test" model. In every model 
developed during this phase of analysis, all three of the 
criteria were met by only the model selected which made the 
choice of the correct prediction model relatively easy to 
select. 

Once these models were selected, forecasts were gener- 
ated for the period October 1983 to March 1984 and compared 
to the actual reenlistment totals for the period. If a model 
was evaluated as totally inappropriate at this point, then 
further investigation and modeling was pursued to attempt 
resolution of the prcilem. 

Table II presents a summary of the model forecasts for 
the selected ratings. Model summaries and statistics are 
presented in appendix C. 
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TABLE II 

EESQLTS OF UNIVARIATE MODEL FORECASTS 



95^ FORECAST LIMITS 
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The model selected 
AEIMA (1, 0, 0) type, 

re-enlistment percentages 
average error of .106 with 
high of .118. all of the 
confidence limits of the 
shape by the model. Fig. 
forecasts and observations. 



for the Yeoman ratinj was of the 
a comparison of the forecasted 
with the actual totals showed an 
a range from a low of -. 172 to a 
observations were within the S5% 
model and were also captured in 
2.16 graphically illistrates the 




Figure 2.16 



PLOT OF ONIVARIATE HODEL FOR YN RATING. 



2 • St orekee per Mod el 

Since the autocorrelations and partial i 
tions did not clearly suggest the selection o£ 
type as being superior to another, an iterativa 
used for selection. After discarding several firs 
MA and ARMA models it was decided to try differ 
though the data did not strongly suggest tha 
necessary. The resultant ACF and PACF satisfied 
ness criteria without indicating overdif f erencii 
of the ARIMA (0,1,1) order was then found to n 
necesarry stringency requirements for model sela 
model generated forecasts with an average error 
a range from -. 064 to -.194. It is felt that 
modeling with the actual data for the forecast p 
reduce this error even further. Fig. 2.17 ilia 
data fit with the model. 

2- O pera tions Specialist Model 

Modeling of the Operations Speciali 
resulted in the selection of an ARIMA (1,0,0) 
appropriate model. The data for the rating pr 
largest fluctuation in range over the entire data 
fluctuations have no doubt influenced tha 

predictive power of the selected model. As a r 
model yielded acceptable predictions varying from 
observations by an average of .155 and a range 
-.315. As with the previous model, successiva 
with the new observations should improve the 
power of the model. Fig. 2.18 illustrates tha 
with the actual observations. 
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Figure 2.17 PLOT OF UNIVARIATE MODEL FOR SK RATING. 

4. Elect ron ics T echni c ian Model 

A first order autoregressive model of the AEIMA 
(1,0,0) type was also selected for the electroiics techni- 
cian rating. The model produced spectacular results with an 
average error of -.025 and a range of errors from -.003 to 
-.045. It should be noted that this data set is also the 
most stable over time with an average of more thin 91 % first 
term reenlistments. While this makes the model' s job some- 
what easier, it still remains as a powerful model for 
univariate predictions. Fig. 2.19 illustrates ths fitting of 
the mcdel. 
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Figure 2.18 PLOT OF DNIVAEIATE MODEL FOR OS RATING- 

5 . B oile r T^hn ician Model 

In the case of the boiler technicians, an ASIMA 
(1,0,0) was again evaluated as the most appropriate, 
however; this model was the poorest predictor of any 
selected for evaluation. The average error was .203 with a 
range from -.328 to .232. The model also failel to capture 
the shape of the actual data which presented an jpward trend 
while the model indicated a downward tarn. Tie model is 
however acceptable for further analysis and refinement in 
the transfer function model. Fig. 2.20 illistrates the 
fitting of the data set predictions. 
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Figure 2.19 PLOT OF ONIVAEIATE MODEL FOR ET RATING. 



D. SDHMARY 

In this chapter, five acceptable univariate models have been 
developed for the respective data sets. These models were in 
most cases fairly obvious from the analysis of the autocor- 
relations and partial autocorrelations, however the process 
can be fairly time consuming and result in the pursuit of 
several "blind alleys" on the way to a workable model. In 
succeeding chapters, a transfer function model utilizing 
unemplcyment statistics for the 20-24 age goup will be 
developed. 
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Figure 2.20 PLOT OF ONIVABIATE MODEL FOR BT RATING. 
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III. THE LEADING IN DICA TOR 

A. OVERVIEW 

A leading indicator 
selected ratings was con 
univariate time series mod 
year old age group and then 
for the selected ratings. T 
then cr osscorrelated to es 
leads or lags that affect 
could provide an early warn 
of reenlistmeat and/or the 
listment. Once developed, 
tested by using the coeffic 
from the indicated lag/lead 
time period were also gen 
results compared to the u 
combined model using time 
were alsc formulated. 

B. THE DNEHPLOYMENT flODEL 

The data for unemploym 
collected from monthly pub 
Statistics. These figures c 
through September 19 83 and 
particular age grouping w 
appropriate for personnel c 
and facing the reenlistment 



, REGRESSION M2 COMBINED MO MM 



model for reenlistmsnt in the 
structed by first developing a 
el for unemployment ia the 20-24 
applying this model to the data 
he resultant model residuals were 
tablish the location of any time 
reenlistments. These indicators 
ing system of shifts In the level 
direction of the trend in reen- 
the adequacy of the model was 
lent of determination (R-sguared) 

. Forecasts for the 0: t 83-Mar 84 
erated in this process and the 
nivariate models forecasts. A 
series analysis and regression 



ent in the 20-24 age bracket was 
lications by the Bureau of labor 
over the period from October 1980 
are presented in Appendix A. This 
as selected as being the most 
ompleting their first enlistment 
decision. 
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^ • Met hod ology 

The une m ploy men t t 
utilizing the same methodo 
chapter to reenlistment tira 
models. The unemployment d 
computing the relative perc 
the next. This was done 
current period from the r 
dividing the remainder by t 
In doing this, it was fel 
better capture responses to 
rather than responses to th 
hypothesized that this woul 
service member that the jo 
ening in relation to the 
[Ref. 5]. 

The computed change 
then evaluated for a potent 
correlation and partial aut 
appeared stationary but di 
for selection. As a result 
for model selection. The mo 
suitability utilizing the 
previous chapter, that 
randomness, smallest sum o 
excess of 2.3. Several t 
moving average models an 
average models did not 
Therefore, it was decided 
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The resulting diff 
stationarity criteria and 
tics of an over diff erenced 
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demand for a particular skill 

in unemployment tima series was 
ial model by screening the auto- 
ocorrelation functions. The data 
d not suggest any obvious model 
a trial and error method was used 
del iterations were evaluated for 
same criteria described in the 
is evaluation of residuals for 

f squared errors and a t-ratio in 
rials of autoregressive models, 
d mixed autoregressive moving 
yield any positive results, 

to difference the data one time 

building process again, 
erenced data set also met the 

did not exhibit any characteris- 
data set. When evaluated at this 
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point, the autocorrelation and partial autD 
suggested that a moving average model was appropr 
model of the ARIMA (0,1,1) type met all of tti 
criteria necessarry and was therefore adopted foe 
results and specifications are presented in Fig. 
Fig. 3.4. 
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Figure 3.1 TIME SERIES PLOT FOR ONEMPLO YMES T DATA. 

A 



42 





-0. 8 -0.6 -0. 4 - 


•0.2 0-0 0.2 0.4 0.6 0.8 




1 ^ • 1 ^ 1 - - 




1 


0. 025 


XX 


2 


-0. 115 


XXXX 


3 


0. 228 


XXXXXXX 


4 


-0. 069 


XXX 


5 


-0. 083 


XXX 


6 


0.090 


XXX 


7 


-0. 045 


XX 


8 


0. 063 


XXX 


9 


0. 002 


X 1 


10 


-0. 124 


XXXX 


1 1 


-0. 038 


XX 


12 


0 . oil 


X 


13 


-0. 032 


XX 1 


14 


-0. 034 


XX 


15 


-0. 029 


XX 








Figure 3 


.2 AUTOCORRELATION 


FUNCTION FOR UNEMPLOYMENT DATA. 


1 




1 




-0.8 -0.6 -0. 4 


-0. 2 0. 0 0.2 0.4 0. 6 0.8 




• _ 1 « - 




1 


0.02 5 


XX 


2 


-0.115 


XXXX 


3 


0.23 8 


XXXXXXX 


4 


-0. 109 


XXXX 


5 


-0.017 


X 


6 


0.02 2 


XX 1 


7 


-0.028 


XX 1 


8 


0.109 


XXXX 


9 


-0.063 


XXX 


10 


-0.080 


XXX 


1 1 


-0.074 


XXX 1 


12 


0.012 


X 


13 


0.015 


X 


14 


-0.045 


XX 


15 


-0.04 3 


XX 



Figure 3.3 PARTIAL AUTOCORRELATIONS FOR UNEMPLOYMENT DATA. 
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TABLE III 

SOMMAEY OF UNEMPLOYMENT MA 1 MODEL 



NUMBER TYPE ESTIMATE ST. DEV. T-RiTIO 

1 MA 1 0.9850 0.0689 14.29 

DIFFERENCING. 1 REGULAR 

RESIDUALS. SS = 896.9 (BACKFOREC ASTS EXCLUDED) 

DF = 34 MS = 26.4 

NO. OF CBS. ORIGINAL SERIES 36 AFTER DIFFERENCING 35 
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Figure 3.4 AUTOCORRELATION OF RESIDUALS FOR UNEMPLOYMENT MODEL. 
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C. APPLICATION TO THE SELECTED RATINGS DATA SETS 
1 . Ye oman 

When the ARIHA (0,1/1) model was applied to both the 
reenlistment data for yeoman and the differenced unemploy- 
ment data set and the cross-correlation function evaluated, 
there appeared to be a relationship at the twelve month lead 
point. The value at this point was significant wien compared 
to the other points however, it was not significant in abso- 
lute terras since it was not in excess of two standard errors 
of the mean zero. This observation of magnitude holds true 
in all of the models. Fig. 3.5 shows the cross- cor relation 
function for this data set for all lead months only. 
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Figure 3.5 CROSS CORRELATION FOR YN RATING. 
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2 • Store ke epe rs 

The cross-correlation function for the storekeeper 
model indicated a possible lead indicator relationship at 
the five month and eleven month points. Both of these points 
were significant in their relationship to the other values 
but again were below the accepted level for determining 
significance. It was decided to investigate the significance 
of these points in the regression procedure. Fig. 3.6 shows 
the cross-correlation function for positive leads of this 
model . 
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Figure 3.6 CROSS CORRELATION FOE SK RATING. 
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3- Operations S pecial i sts 

The cross-correlation function again iniicated more 
than one point for possible investigation as being rela- 
tively significant. These points occurred at ths six, nine 
and twelve month points. Fig. 3.7 again shorf s the lead 
values for this model. 
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Figure 3.7 



CROSS CORRELATION FOR OS RATING 



4- Elect ron ic s Tech nic ian s 

The cross-co rrelati on function for this model also 
suggested more than one lead point for investigation, these 
occurred at the six and twelve month points but again were 
significant only in relative terras and not In absolute 
magnitude. Fig. 3.8 illustrates the lead relationship in the 
cross-correlation function. 
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Figure 3,. 8 CROSS CORRELATION FOR ET RATING. 
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5 



- Boi l er Techni cians 

Consistent with the previous models, tie model for 
Boiler Technicians also presented more than one prominent 
point for evaluation. These points occurred at the nine and 
twelve month points. Fig. 3.9 is the cross-correlation func- 
tion for the lead values of the model. 
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Figure 3,9 CROSS CORRELATION FOR BT RATING. 
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D. LINEAR EE3RESSI0N 



In order to verify the leading indicator models, a 
linear regression will be constructed using 20-24 year old 
unemployment as the independent variable and ceenlistraent 
rates for the selected ratings as the dependent variable. 
For this process to be valid, certain assunptions are 
required prior to application of the process. The first 
assumption is that there is a linear relationship between 
the variables as described by eqn. 3. 1 



Y = a + BX, e 



(eqn 3.1) 



where for each observation, Y is a random variable. The 
second assumption is that X is fixed in value and the final 
assumption is that e, the error term, has an expected value 
of zero with constant variance for all observations. It is 
further assumed that the e’s are normally distributed and 
uncorrelated. [Ref. 6] 

E. MODEL VERIFICATION 

Linear regression models were run for all of tie data sets 
using the lagged value of the unemployment data as the inde- 
pendent variable and the reenlistment data as tie dependent 
variable. The regressions were evaluated using tie R-sguared 
value, Durbin- Watson statistic and t-ratio A sunmary of the 
regression models R-sguared and Durbin Watson statistic is 
presented in Table IV. 

The regression models for all of the ratings were less 
than robust in their ability to verify the significance of 
the leading- indicator models. R-squared values ranged from a 
high of .34 for the Yeoman model to a dismal -. 02 for the 
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TABLE IV 

SUMMARY OF R-SQDARED AND DURBIN-BATSON STATISTICS 

X=UNEMPLOYMENT FOR 20-24 YEAR OLD AGE GROUP 

SELECTED EATINGS R-SQUARED VALUES 



LAGS (X) (DURBIN-WATSO N STATISTIC IN PARENS) 





YN 


SK 


OS 


ET 


BT 


1 


.265 

(1.54) 


.22 

(1.77) 


.027 

(-82) 


. 181 
(1.22) 


-.03 

(.871 


2 


.272 

(1.56) 


. 154 
(1-81) 


.078 
(- 83) 


. 141 
(1.2) 


-.03 

(.871 


3 


.30 

(1-55) 


. 199 
(1.56) 


.025 
(- 84) 


. 133 
(1.22) 


-.031 

(.84) 


4 


.349 

(1.43) 


. 163 
(1-72) 


.037 

(-81) 


.115 

(1.51) 


-.027 

(.831 


5 


.213 

(1-3) 


. 134 
(1-76) 


. 031 
(-81) 


. 041 
(1.39) 


-.023 

(.84) 


6 


.259 

(1.31) 


. 229 
(1-76) 


.006 

(.79) 


. 022 
( 1. 55) 


-.032 

(.79) 


7 


.272 

(1-25) 


.253 

(1-75) 


. 002 
(.8) 


. 003 
(1.6) 


-.035 

(.81) 


8 


.21 

(1 - 16) 


.224 

(1-6) 


. 0 04 
(.79) 


-.009 

(1.65) 


-.034 

(.74) 


9 


.148 

(1-16) 


. 183 
(1.58) 


-. 023 
(.81) 


. 024 
(2.08) 


-.027 

(.73) 


10 


.171 

(1-14) 


. 193 
(1-69) 


-. 023 
(. 79) 


. 024 
(2.08) 


-.027 

(.75) 


11 


.098 

(1-3) 


.115 

(1-97) 


-. 013 
(.74) 


-.011 
(2. 08) 


-.043 

(.75) 


12 


.203 

(1.66) 


. 130 
(2.24) 


-. 045 
(. 71) 


. 05 1 
(2. 09) 


-.042 

(.8) 



models. This procedure should yield forecasts with a better 
fit than was possible with only the univariate or regression 
models. This procedure closely parallels the work, of Darling 
[Ref. 2], in modeling the supply of recruits for the Marine 
Corp. The comlined model will be developed in the next 
section. 
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F. THE COHBINED MODEl 

The regression models in the previous section yielded 
adequate forecasts of the r eenlistmen t rate, thsy were not, 
however, significantly better than the forecasts for the 
univariate models in chapter three. Additionally, the 
residuals of the regression model exhibited strong positive 
serial correlation as indicated by the low values of the 
Durbin- Watson statistic for each model. Table I'/ summarizes 
the data for the regression models. As shown by Darling 
[Bef. 2], this enables the residuals to be constructed into 
an independent time series. Through the application of the 
Box-Jenkins method a forecast of the residuals or error 
terms can be generated and applied to the regression models 
forecasts. This procedure should yield a forecast with a 
better fit to the actual data. 



TABLE V 







REGRESSIO 


N MODELS 


FOR SELECTED 


RATINGS 


RATE 










DUR3 IN- 




LAG 


CONSTA 


NT BETA 


ST. DEV. 


R-SQR 


D WATSON 


YN 


4 


9. 05 


3.32 


9.97 


.327 


1.43 


SK 


7 


1 . 59 


3.57 


11.88 


.253 


1. 75 


OS 


4 


73.2 


-2. 08 


16. 82 


.037 


.81 


ET 


1 


73.2 


1.31 


5.53 


.181 


1.22 


BT 


5 


52.6 


-.743 


14. 93 


-.023 


.84 



The Box-Jenkins methodology as described in Chapter 
three was again applied to the sets of regression model 
residuals and appropriate models were selected. Table VT is 
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a summary of the forecasts for all three aethods and 
percentage of error for each forecast. 

The combined model resulted in improved forecasts in 
three of the five ratings with the other two showing either 
minor improvement or a slight decline (BT, OS). Et should be 
noted at this point that these two data sets ware the most 
volatile in terms of range of observations. Ta e forecasts 
for these ratings could be improved by eliminating signifi- 
cant outliers from the data sets and recomputing all of the 
models. Due to the already small size of the data sets, this 
was not considered. This problem should be corrected in 
future works in this area as more data points become avail- 
able for analysis. 
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FORECAST COMPARISONS OF ALL. THREE MODELS 
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IV „ SOMMAEY and CONCLUSIONS 



A. SUMMARY 
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listment rates at successiv 
dent variable, unemploym 
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chnigues have been examined in 
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ratings. Two distinct methods 
odels; a univariate Box-Jenkins 
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ied in predictive ability, with 
early superior to the other two, 
ror of the actual observations), 
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the least variance In reenlist- 
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ery low R-squared value of the 
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. Boiler technicians and opera- 
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ent. This indicates that an 
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. For BT' s this may be a dummy 
e -unpleasantness of the working 
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conditions or the level of their reenlistment bonuses. For 
the OS rating, it may also be the reenlistment bonus level 
or a factor accounting for the high amount of sea duty 
present in that rating when compared to others. The ratings 
of yeoman and storekeeper presented models that were 
marginal when using Eox-Jenkins or regression separately but 
presented quite good predictions when utilizing the combined 
model. 

A somewhat surprising result of the models forecasts was 
the accuracy of the regression model using 20-24 year old 
unemployment as the only independent variable. This is 
surprising in view of the low R- squared values of the models 
and the high degree of serial correllation remaining in the 
residuals as expressed by the Durbin-Watson statistic. This 
was actually the second most accurate prediction model 
outperforming the univariate Box-Jenkins model oy a slight 
margin. 

The Box-Jenkins models* performance was restricted by 
the size of the data set available. Technically, thirty or 
more observations in a data set are considerei sufficient 
but 100 or more observations are considered desirable in 
order to utilize the full predictive power of the model. 
This larger number of observations is also considered desir- 
able in terms of identifying the underlying trends and 
patterns which may not appear in a smaller set of data. In 
terms of forecasting reenlistments, it is not possible at 
this point to utilize any more data points than were avail- 
able for this study since the monthly figures ara aggregated 
by quarters after three years and only the quarterly data 
retained. 
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B. CCNCIUSIONS 



A surprising finding, for all of the ratings modeled, is the 
continued rise in reenli stments in view of the ever 
improving economy during the period. This could possibly be 
explained, in a regression model, with the introduction of 
the civilian/military pay ratio for the period or the level 
of reenlistment bonuses for a rating. This would still, 
however, not account for the 95 percent reenlistment rate 
for electronics technicians who are generally regarded as 
having the most desireable and marketable skills in almost 
any employment market. Another explanatory term could be 
introduced for "taste" for military service much as the ACCL 
model uses. In light of the world situation and recent 
events in Lebanon, Granada and the Persian Gulf this could 
be a significant explanatory factor for the continued rise 
in reenlistments. 

In terms of policy implications, the results for all of 
the models utilized indicate that high levels or first term 
retention are likely to continue in all of these ratings for 
the next six to twelve months. At this point, derisions will 
be required on how to deal with these increases in a service 
that is rapidly approaching authorized end strength. The 
longer range forecasts still seem to indicate that the 
currently favorable climate will eventually give way to an 
ever improving economy. Now would seem to be the most oppor- 
tune time to take advantage of the situation bf increasing 
the total number of personnel in the career force as a hedge 
against the future change in the demographics of the cohort 
eligible for military service. This will indjce a short 
term increase in compensation costs by increasing the inven- 
tory of career petty officers. This will evantually be 
offset by the reduction in future training and recruiting 
costs that will result from this larger career farce. 
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SUGGESTIONS FOR FUTURE RESEARCH 
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aPPENMX A 

SUMMARY OF DATA USED IM ANALYSIS 
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42.5 


28.7 
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58.6 


62. 5 
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44. 4 
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37. 9 


68. 1 


50. 
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65.9 
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47. 


66.3 


55. 8 


40.3 


53.9 


53. 2 
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67.4 


70. 2 


53.8 


82.8 


68. 8 


46. 5 


60.7 


64.6 


41.5 


55.6 


69.8 


41. 2 


66. 


70.8 


52. 6 


68.8 


72. 


52.7 


59.1 


66. 7 


48.9 


51.8 


75. 


62. 5 



U nemo loy 
ment 

77.7 12.2 13.8 

85. 1 46.9 1 2.1 

80.2 47. 4 1 1.7 

73.6 29.8 1 1.8 

88. 6 39. 9 1 1.8 

82.2 39. 4 1 1.7 

88.5 56.6 12.1 

87. 1 62. 8 12.9 

80.8 33. 7 1 2.1 

93.7 29. 8 1 1.3 

94.3 29.1 11.8 

92.5 18.5 12.1 

93.8 33.3 1 2.8 

95.9 51.7 13. 

93. 8 54. 8 1 3.5 

96.7 66.3 1 3.5 

97. 67. 7 1 4.1 

97.1 67. 14.2 

90.4 37.3 14.7 

97.6 56. 2 1 4.3 

91.2 28.9 1 4.4 

96.2 28. 9 14.5 

92.2 15.4 15.2 

85.8 17.7 15.3 

91.8 32.3 1 5.9 

93. 39. 1 18. 

97.3 47.9 -17.8 

87.9 30. 3 1 7.6 

99.5 45.1 17.8 

96.8 41.8 16.6 

98.4 46.7 1 7. 

91.9 41.9 17.6 

92. 36. 1 15.7 

95.2 37. 1 5.7 

100. 51.7 15.9 

98. 1 62. 3 1 5. 

95.9 42.6 14.8 

95.6 56.5 14.8 

95. 5 48. 13.7 

96.2 52. 14.2 

94.2 55.6 14.0 

95.9 61.8 13.8 
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DATA UTILIZED FOE ANALYSIS 
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APPMWX B 

THE BOX-JENKINS METHOD 



A. 07EEVIEW 

The Box-Jenkins procedure can be used to fit and fore- 
cast time series data by means of a general class of statis- 
tical models. An observation at a given point in time is 
modeled as a function of its past values and/or current and 
past values of the random errors, both at seasonal and non- 
seasonal lags. Box-Jenkins methodology will model a variable 
with observations equally spaced in time and no missing 
values. Sometimes it may be necessarry, before nodeling the 
series, to transform the data by taking the log function, 
square root, power of the series or to difference the series 
on a seasonal or non-seasonal basis. 

The modeling of time series data is usually done in 
three steps. First, identify a tentative molel for the 
series. Second, estimate the parameters^ and examine the 
diagnostic plots and statistics. Third, if tie model is 
deemed acceptable, utilize the procedure for forecasting. If 
the model is inadequate, return to step one and evaluate the 
time series for more appropriate models until aa acceptable 
one is found. Fig. B. 1 illustrates the steps required in 
Eox-Jenkins analysis. 

The advantage gained in using Box-Jenkins analysis is 
that it allows the data to speak for itself slice it is a 
univariate procedure and therefore does not alio# for expla- 
natory variables. The underlying more restrictive assumption 



^For most software packages applied to Box-Jenkins 
modeling, the estimation or parameters is done ai tomatically 
in the program leaving the researcher free to concentrate on 
analysis of the resulting statistics. 
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throughout all Box-Jenkins procedures is that 
series vill eventually repeat itself [Ref. 7], oc 
is some pattern underlying the data. 

B. WHAT IS A TIME SERIES 

A time series is a collection of observatioa 
sequentially over time at specific intervals su; 
days, weeks, months or years. In addition, a cer 
dence is supposed from one period to the next, 
interdependence that is of value when trying t 
future activity for a time series. Examples of 
abound in fields ranging from business to physL 
applied to analyze monthly sales for a company, 
yields on fiduciary notes or the chemical yield a 
substance in a controlled procedure. A time seri 
be used to analyze observations that are either 
continuous; by way of example, a discrete time 3 
be the closing stock price of a company and a 
time series would be the temperature at the weat 
In summary, a discrete time series is one where o 
exist at a point in time and a continuous time 
potential observations at all points in time. F 
of this discussion., only discrete time seri 
addressed. 

C. STATIONARITY 

In pursuing a time series model, the first as 
be made in tha analysis is that the data set is 
By this it is meant that the observations oscilli 
constant mean that shows no growth over time, 
about this mean are temporary and in the long 
equilibrium about the mean [Ref. 4]. 
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Figure B. 1 THE BOX-JENKINS HETHODOLOGlf . 
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A further measure of stationarity can be gaii ed from the 
autocorrelation of the time series, that is the correlation 
between successive observations from the same di ta set. An 
observation at time t, denoted by Zt, when core elated with 
an observation Zt+1 from the same data set is said to 
produce an autocorrelation. The autocorrelation is measured 
by pk and provides important information about tae nature of 
the data set. A value close to + 1 indicates a hijh degree of 
positive correlation between observations, while a value 
close to -1 indicates a high negative correlatioa . 

Most time series are not stationary and require some 
type of transformation prior to analysis. 

D. TIME SERIES PLOT 

The first step in determining whether a tine series is 
stationary or not is to construct a time series plot of the 
data which plots observations against time in an attempt to 
visually determine any obvious patterns in the data. Fig. 
B.2 illustrates the United States gross national product for 
the years 1947 through 1970 on a quarterly basis. This plot 
shows a clear upward trend in the data which indicates the 
data is not stationary, further within each year there are 
apparently recurring patterns for each quarter that repeat 
on annual basis. Finally, as time passes, the variance in 
GNP tends to become larger and more volatile. Clearly, this 
data set must be transformed prior to further analysis by 
the Box-Jenkins method. 



E. DATA TRANSFORMATICN 

To continue with the example of GNP, there are several 
possible transformations that can be used to induce statio- 
narity. The first step is to, induce a constant variance in 
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the data, this can be accomplished either through a 
logarithmic or a square root transformation. Fig. 3.3 illus- 
trates the results of a square root transform of the data. 
The trend is still clearly present but the variaoce has been 
smoothed considerably. 

Once variance has been stabilized, the next step is to 
remove the trend. There are several sophisticated regres- 
sion techniques available to accomplish this however, the 
method of differencing will be the only one addressed here. 
For a more detailed discussion of these alternative techni- 
ques, the reader is directed to Hakridakis and Wheelright 
[Ref. 4]. 

The method of differencing a time series consists of 
subtracting the values of the time series from each other in 
a specified order. By way of example, consider tae following 
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Figure B.3 SQUARE ROOT TRANSFORMATION OF U. S. 3NP 1947 - 1970. 

data set; 1,3,5,7,9,11. When plotted, this set has an 
obvious linear trend, a first order differen'e involving 
subtraction of the first observation from the second, the 
second from the third and so on results in the transformed 
series *,2,2,2, 2,2. By taking the first difference, the 
trend disappears and yields a stationary data set. The 
indicates that whenever a data set is differenced, one 
observation is lost for each difference operator. Due to 
random fluctuations in real data, such clear cut results as 
those illustrated should not be anticipated however, for the 
majority of data sets differencing will induce a sufficient 
amount of stationarity to proceed with further analysis. 
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ADIOCOREELATION AHD PARTIAL AUTOCORRELATION 
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G. THE AUTOREGRESSIVE MODEL 



A time series is said to be governed by an 
sive, (AR) , if the current value of the time S 2 
expressed as a linear function of the previou 
values plus some error term or random shock valj 
The assumptions made here are that the da 
stationary and the error terms are normally an 
dently distributed with a mean of zero and con 
ance. A check on the' adequacy of the model is t 
an ACF for tha residuals of the model and determ 
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are random in nature. Mathematically, an AS (1) model is of 
the form: 



= (f + a^ (egn B.1) 

where (j) is egaal to the autoregressive coef f ici2 nt and a is 
the random error or shock, term. 

H. THE MOVING AVERAGE MODEL 

A time secies is said to be governed by a mo/ing average 
process if the current value of the time secies can be 
expressed as a linear function of the current ercor term and 
previous error term(s). The same restrictions i pply to the 
error terms of am MA model as applied to the AS model. 
Mathematically, this function can be expressed as; 



- 6a^ (egn B.2) 

where 6 is the moving average coefficient and a Ls again the 
error term. 

I. THE MIXED ADTOREGEESSIVE MOVING AVERAGE MODEL 

The mixed autoregressive moving average molel contains 
elements of both the AR and the MA procedures aa d expresses 
the relationship of a current observation as a linear func- 
tion of both past values and past errors of the variable. As 
with the AR and MA models, the residuals of the model are 
evaluated for adeguacy by utilizing the ACF fuaction. The 
eguation for the ARMA model is: 
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figure B,4 TYPICAL FORM OF AR1 MODEL ACF AS D PACF. 



Z 



t 




Z-t-i 



+ a. 



©a 



■c - I 



(egn B.3) 



J. EVAL0ATIN3 THE MODEL 

Once a model has been selected/ there are several ways 
to check the results for adequacy- For purposes of this 
discussion, the following checks will be addresssd; 

1 . ACF of residuals 
2- Minimum sum of squares 
3. t-ratio 

There are several other checks for adequacy that are avail- 
able to the user of Box-Jenkins methodology, for a more 
comprehensive explanation the reader is directed to Vandaele 
[Ref. '8]. 
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Figure B.5 TYPICAL FOHM OF MAI MODEL ACF AS D PACF. 

1 . ACF of The R esid uals 

As mentioned throughout this discussion, the ACF for 
a model should be random about the mean zero with constant 
variance and a magnitude less than two standard errors. 

2 . Minim um Su m of Squ a res 

Determination of this measure can only be achieved 
by comparison with other potential models. In some cases, 
several models may produce insignificantly different sums of 
sguares which will test the user's judgment and application 
of the other measures of adequacy. 

3 . t-Rat io 

In time series analysis this is computed by dividing 
the estimate of the parameter for the model by the standard 
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deviation for the series. The rules as appliei to regres- 
sion analysis still hold in that the value should be greater 
than +/-2.0 in order to indicate that the cosfficient is 
significantly different from zero. 



K. PARSIMONY 

In the event that more than one model is 
satisfying the acceptance criteria described i 
principle of parsimony will then apply. This 
when faced with several sufficient model types, 
the lowest order model available that sat 

crit eria . 

L. TRANSFER FUNCTION MODELS 

Also known as multivariate autoregressive 
moving average, (MARIMA) , or leading indicator m? 
involves selection of an appropriate univariate 
what is to be the independent variable and appL 
the dependent variable. The application of the 
result in two sets of residuals which when cross 
at different time lags will yield the cross 
function, (CCF). This differs slightly fra 

discussed earlier in that we now expect to find 
ship of significant magnitude at various poii 
comparison. This positive correlation is an indi 
there is a significant relationship between the 
and the dependent variable at certain lags or lei 
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Figure B.6 TYPICAL FORH OF ARMA 1,1 MODEL ACF AND PACF. 
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APPENDIX C 

SDKMAEY OF BOX-JENKINS MODELS OSED FOR ANALYSIS 
A. MODELS USED IN UNIVARIATE ANALYSIS 

1 . Yeo man 

Model Type - ARIMA (1,0,0) 

Model Equation 

Z = . 4482^_I + a^ (egn C.1) 

T-ratio - 2.64 

Model Residuals - Random 

2 . Storeke epe rs 

Model Type - ARIMA (0,1,1) 

Model Equation 

Z = -.8 14a^_, + a^ (eqn C.2) 

T-ratio - 7.24 

Model Residuals - Random 
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3- Ope ra tio ns S p ecial i sts 

Model Type - ARIHA (1,0,0) 

Modal Equation 

Z = . 499Z^_, + a^ (eqn C.3) 

T-ratio - 3.39 
Model Residuals - Random 
^ • Ele ct ronics Tech nic i an s 

Model Type - ARIMA (1,0,0) 

Model Equation 

Z = - 588Z^_| a^ (egn C.4) 

T-ratio - 4.18 

Model Residuals - Random 
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5. B oile r Techn icians 



Model Type - ARIMA (1,0,0) 

Model Equation 

Z = .54 1Z^_, + a^ 

T-ratio - 3.63 

Model Residuals - Random 

B. BOX-JENKINS MODELS OF REGRESSION RESIDOALS 
1 . Yeoman 

Model Type - ARIMA (0,0,1) 

Model Equation 

Z = -. 3 76a^_i * 

T-ratio - -2- 19 

Model Residuals - Random 



(eqn C.5) 



(eqn C.6) 
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2 • St oreke e pe rs 

Model Type - ARIHA (1,1,0) 

Model Equation 

Z = .498Z^_, (egn C.7) 

T-ratio - -3.00 
Model Residuals - Random 
2* Op e rat ions S pe c ial i sts 

Model Type - ARIHA (1,1,1) 

Model Equation 

Z = .604Z^_| + a^ - .956a^.j (eqn C.8) 

T-ratio - AR1 - 3.28 
MAI - 9,75 

Model Residuals - Random 
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• Electr onics Te chni c ians 

Model Type - ARIMA (1,0,1) 

Model Equation 

Z = .854Z^_, + 

T-ratio - AP. 1 - 5 
MAI - 2 

Model Residuals - 

5 • Boi ler T echn ici ans 

Model Type - ARIM 

Model Equation 

Z = -590Z^_^ +a^ ( eqn C.10) 

T-ratio - 3.70 

Model Residuals - Random 



a^ -. 529a^_j (eqn C.9) 

.21 

.04 

Random 
A (1,0,0) 
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